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Convergence Analysis of Saddle Point Problems in Time Varying Wireless Systems - 
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Abstract 

Saddle point problems arise from many wireless applications, and primal-dual iterative algorithms 
are widely applied to find the saddle points. In the existing literature, the convergence results 
of such algorithms are established assuming the problem specific parameters remain unchanged 
during the iterations. However, this assumption is unrealistic in time varying wireless systems, as 
explicit message passing is usually involved in the iterations and the channel state information (CSI) 
may change in a time scale comparable to the algorithm update period. This paper investigates the 
convergence behavior and the tracking error of primal-dual iterative algorithms under time varying 
CSI. The convergence results are established by studying the stability of an equivalent virtual dynamic 
system derived in the paper, and the Lyapunov theory is applied for the stability analysis. We show 
that the average tracking error is proportional to the time variation rate of the CSI. Based on these 
analyses, we also derive an adaptive primal-dual algorithm by introducing a compensation term to 
reduce the tracking error under the time varying CSI. 

Index Terms 

Saddle Point, Convex Optimization, Convergence Analysis, Time- Varying, Lyapunov Stability, 
Network Utility Maximization 

I. Introduction 

Saddle point problems arise in a number of wireless communication applications such as resource 
allocations and competitive games. Resource allocations problems can be formulated as a constrained 
maximization of some utility functions. By constructing a Lagrangian function, the constrained 
problem can be reformulated into an unconstrained one and be solved by computing the saddle point 
of the Lagrangian function, where we maximize over the primal variables and minimize over the dual 
variables. Most remarkably, primal-dual gradient methods have been widely used for computing the 
saddle points of general Lagrangian functions, while they provide decentralized solutions in wireless 
applications. The primal-dual gradient methods update the primal and dual variables simultaneously 
by evaluating the gradient of both the primal function and dual function at the same time. A classical 
study of primal-dual algorithms has been done by Arrow, Hurwicz and Uzawa in their seminal work 
m, where they constructed a continuous-time algorithm for general concave-convex functions and 
derived the asymptotic properties using Lyapunov-like methods. They also provided a global stability 
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result on a discrete-time algorithm with constant step sizes. Recently, Feijer et al. ||2l, Q have 
studied the stability of these primal-dual algorithm dynamics and extended the results to various 
network resource allocations problems. 

In the above literature, when people discuss the convergence behavior of the primal-dual gradient 
algorithm, all the problem specific parameters are considered to be time invariant. Even in most other 
literatures studying optimization problems in wireless communications, problem dependent parameters 
are merely considered to be quasi-static, meaning that these time varying parameters are assumed to 
be constant for a sufficiently long time until the algorithms converge. However, this assumption is 
unrealistic, especially in wireless communication scenarios. For instance, the operating environment 
in terms of the channel state information (CSI) may be changing frequently, such that the optimization 
problem varies from time to time. On the other hand, as explicit message passing may be involved 
in the iterations, the optimization algorithms cannot always converge fast enough to catch up with 
time varying effects, especially for large scale problems. As is shown in our numerical example, 
convergence errors would lead to performance loss in a wireless communication system. However, it 
is yet unknown whether the algorithms converge or not when the quasi-static assumption is dropped, 
even for strictly convex problems. Therefore, it is highly important to the study the convergence 
behavior, or robustness, for primal-dual gradient algorithms under time varying CSI. 

However, towards this end there are a lot of technical challenges, some of which are listed below. 

• How to quantify the performance penalty due to the time varying parameters 

• How to evaluate the cost-performance tradeoff 

• How to enhance the algorithm 

These questions are highly nontrivial due to the stochastic nature of wireless communication 
problems as well as the complexities of the algorithms that solve them. There are some preliminary 
works studying the effects caused by time varying parameters. For example, for resource allocation 
problems, the authors in [4| had introduced stochastic perturbations to the algorithm to represent 
random loads entering the network, and their analyses were based on linearization on the system. 
The authors in ||5l studied a stochastic network utility maximization (NUM) problem with noisy 
feedback input to the algorithm. However, and |0 did not consider the CSI being time varying 
and hence their problems have static equilibrium points. In 0, the authors studied the performance 
of gradient algorithms for a game problem with moving equilibria, and the authors in Q, HI studied 
multi-cell CDMA interference games under time varying channels. While these work |l6l, ||71, IH 
provide some preliminary results for solving games under time varying CSI, their techniques cannot 
be applied to general saddle point problems. 

In this paper, we shall investigate the convergence behavior of primal-dual algorithms for solving 
general saddle point problems under time varying CSI. We first define an equivalent virtual dynamic 
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system, following which we study the stability of such virtual dynamic system based on the Lyapunov 
theory ||9l and the LaSalle's invariance principle argument lITOl . We model the dynamics of the 
time varying CSI as an auto-regressive system, and the stability results are derived by constructing 
Lyapunov functions along the algorithm trajectories. Based on the stability results, we derive the 
convergence properties of primal-dual algorithms. We also quantify the average tracking errors in 
terms of the average exogenous excitations induced to the CSI dynamics. Furthermore, to enhance 
the tracking performance, we propose a novel adaptive algorithm with distributive implementation 
solutions. As an illustration, we also consider two wireless application examples, based on which we 
show that the numerical results match with our analysis. 

The paper is organized as follows. In Section we introduce a general saddle point problem 
as well as the primal-dual algorithm with two application examples in wireless communication. We 
also introduce the time varying CSI model, and the virtual dynamic systems. Then in Section lin] 



and Section IV we analyze the convergence behavior of the primal-dual algorithm for the strongly 



concave-convex saddle point problem and degraded saddle point problem, respectively. In Section [Vj 



we propose a novel adaptive algorithm to reduce the tracking error in time varying CSI. Section VI 



gives the simulation results and discussions. Finally, we summarize our main results in Section VII 
Notations: Matrices and vectors are denoted by capitalized and lowercase boldface letters, re- 
spectively, {ai^) denotes the transpose of matrices (vectors) A (a), and denotes the complex 
conjugate transpose. M and C denote the set of real numbers and complex numbers, respectively, 
a ^ b denotes the entry-by-entry comparison and A^ B means that A — B positive semidefinite. 
Unless specified in the context, || . || denotes the Euclidean norm. 

II. System Model and Virtual Dynamic Systems 

In this section, we shall first introduce a general saddle point problem setup and illustrate with a 
few examples in the context of wireless communications. We shall then define the time varying CSI 
model and discuss its impact on the convergence of saddle point problems. Finally, we shall introduce 
the notion of virtual dynamic system and establish the equivalence between convergence behavior of 
the saddle point algorithm and the stability of the virtual dynamic system. 



A. General Saddle Point Problem 

We consider a min-max optimization problem 

mill max £(x, A; h) (1) 

where the function £ : x R™ i— > R has two vector variables. Moreover, £(x, A; h) is strongly 
concave in x G and convex in A G R^. h G ^ C R"? is a vector parameter that arises from 
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specific optimization problems. In the context of wireless communication optimizations, the problem 
parameter h can be the CSI. Under the above convexity assumption, the min-max problem has a 
unique optimal solution (x*, A*). Note that since the min-max problem in ([T]) is parameterized by h, 
the optimal solution (x*, A*) can also be represented as a mapping from h € ^ to (x*, A*) 

X* (h) = (h) , A* (h) = V'A (h) Vh G 1^ 

where tpx '■'H ^ M" and ipx -.T-L ^ are functions. 

It is known that solving the above min-max optimization problem ([T]) is equivalent to computing 
the saddle point of £(x, A; h) |11|. For a given h G 7^, a saddle point (x*, A*) of £(x, A; h) over 
the set R" x is defined to be a vector that satisfies 

/:(x, A*; h) < £(x*, A*; h) < £(x*, A; h), Vx G M", A G M™. 

It is also known that when C is strongly concave in x and convex in A, the saddle point is unique 
ifTTTl . meaning that the optimal solution to ([T]) is unique. 
The classical primal-dual gradient algorithm dynamic^ for solving ([1]) are given in the following: 



dt 

■ _ d\ 

A — 

dt 



|^£(x, A; h(t)) 



-|^^(x, A; h(t)) 



(2) 

T1 + 

(3) 

A 



for some step size k > 0. The projection [.\\ is to restrict A in the nonnegative domain W^. For scalars 
Ui and Aj, the projection is defined to be [ui\^_ := Ui if Uj > or Aj > 0, and [n,]^ := otherwise. 
For the vector case, the projection is defined entry-wide. This algorithm was first introduced by Arrow 
et al. yj and has been recently studied and applied to various resource allocations applications [3|. 
The asymptotic behavior has been analyzed by studying the Lyapunov function of the system state 
and applying LaSalle's invariant principle argument ||9l, ifTOll . 

In all these works, the convergence results were established based on the assumption that the 
wireless channel state h stays time invariant before the algorithm converges. In practice, the iterations 
in (|2])-([3]) may involve explicit message passing among nodes in a wireless network, and hence it is 
quite unlikely for the h to be invariant during the iterations. When the channel state h(t) is time 
varying, all the existing convergence results [Tl, ||3l fail to apply. In this paper, we shall address the 
situation where the CSI h(t) changes in a similar time scale as the primal-dual algorithm in Q-Q, 
which reflects a more realistic situation in practice. 

'in this paper, we assume the step size of iterations are sufficiently small so that the primal-dual iterations can be 
represented by the continuous time dynamics fTl. l3l. 
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B. Examples of Saddle Point Problems in Wireless Communications 

In this subsection, we shall illustrate various important applications of saddle point problems in 
the context of wireless communications. 

1) Application 1: Transmission and Jamming Strategy Optimization in MIMO Channels: Consider 
a point-to-point MIMO system with a jammer as shown in Fig. [T] The transmitter and the receiver 
have N antennas. The user X transmits the desired signal x ~ CAA(0, Q) to the receiver Y, while 
the jammer Z transmits a jamming signal z ~ CA/'(0, Z) to interfere Y. The received signal at Y is 
given by y = Hix + H2Z + n, where n ~ CA/'(0, cj^I) is the additive Gaussian noise, and the mutual 
information between X and Y is given by |[T2l 

C7(Q, Z) = logdet (l + {all + HaZH^^)"^ HiQHf ) . (4) 

Assume that both the user and jammer have perfect transmit CSI (CSIT), and they know each other's 
transmission covariance. Thus the user's strategy is to maximize the mutual information given in 
(|4]) based on the CSIT and the knowledge of the jammer's transmission covariance, while the the 
jammer's strategy is to minimize that mutual information. Their strategies result in finding the saddle 
point of the following saddle point problem |[T3l . 

Problem 1 (Transmission and Jamming Strategy Optimization): 

mill max C7(Q, Z; h) = logdet fl + (ct^I + HsZH?)"^ HiQHfV (5) 

tr{Z)<Pj tr{Q,)<PT \ \ ^ J ^ J 

where h = vec([HiH2]), Pt and Pj are the total power constraints for the user and jammer, 
respectively. 

Note that the saddle point problem in ([5]l is strongly concave in Q and strongly convex in Z 
|[T4l . and hence there exists a unique saddle point, which can be computed by a primal-dual iterative 
algorithm Q-Q. When the CSI h(t) is time invariant, the transmission policies for both the user and 
jammer will converge to the saddle point of ([5]), and a Nash Equilibrium is achieved. However, when 
the CSI h(t) is time varying in a similar timescale as the primal-dual iterations, the convergence is 
not guaranteed. Hence, it is interesting to study the convergence behavior of the algorithm iterations 
under time-varying CSI. 

2) Application 2: Network Utility Maximization of Wireless Ad Hoc Network: In a resource 
allocation problem, the communication network is modeled as a set of source nodes transmitting 
traffic through a set of links. Each source node has its own concave utility function and is allocated 
a portion of resources. The optimization problem is to maximize the total network utility under a set 
of constraints on the total network resources. The problem is formulated as follows Q, lITSll . 
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Problem 2 (Network Utility Maximization): 



subject to E(s,d):«GL{s,d) ^ sd < ci{pi; h) V/ 



(6) 



per 



where the ordered pair (s, d) denotes the traffic, each of which has a fixed route, that initiates 
from source node s and is delivered to destination node d, r^d denotes the data rate for the traffic, 
p = [pi . . . is a vector of power allocated to L links, Usd is the utility function to evaluate 
traffic rate r^^ for source s, and c;(.) is a link capacity function of the transmit power pi and channel 
gain (i.e. CSI) h. C denotes the collection of all traffic flows, L(s, d) denotes the set of links that 
traffic (s, d) goes through, and V denotes the set of feasible power vectors. We assume both Ugdi-) 
and q(.) are strongly concave and twice differentiable. Fig. |2] illustrates an example NUM problem 
with 3 nodes, 2 links, and |C| = 3 traffic flows. For example, the collection of traffic flows is 
given by C = {(1,2), (1,3), (2,3)}, and the sets of links are L(l,2) = {1}, L(l,3) = {1,2} and 
L(2,3) = {2}. 

For notation convenience, we denote x = [r ; p] as a vector of the primal variables. The Lagrangian 
of ([6]) can be written as 



subjected to the power constraint p £V. From the Lagrangian theory lUTI, solving the optimization 
problem ([6]) is equivalent to finding the saddle point of the Lagrangian function ([7]), 



which is a special instance of ([T]), and the primal-dual algorithm Q-Q can be applied to solve this 
problem. 

Note that as solving the dual problem involves explicit message passing among nodes, the conver- 
gence rate of the algorithm is critical to the solution quality of the NUM problem. Since it may not 
be justified in practice that the channel state h(t) is time invariant during the algorithm convergence, 
it is very important to study the convergence behavior of the primal-dual iteration in Q-Q when 
the CSI is time varying. 

C. Time Varying CSI Model 

In this section, we first introduce a dynamic model for the time-varying CSI h(i). Specifically, the 
CSI h(t) is modeled as a solution to the following dynamic equation 




(7) 




Lax £(r, p, A; h) 

i;pG7' 



(8) 



h{t) 



dh{t) 
dt 



A{h{t) - h) + u{t) 



(9) 
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where ^4 is a real symmetric negative definite matrix, u{t) is a vector valued complex Gaussian 
process with uncorrelated real and imaginary components, and h is a constant vector, corresponding 
to the line-of-sight (LOS) component in the channel model. The dynamic system can be viewed as an 
external disturbance u{t) being applied to an autonomou^ linear system h = A{h{t) — h). Note that 
the dynamic model ([9]) resembles an AR(1) process and |h(t)| has a stationary Rician distribution. 
As is summarized in |[T6l . the AR models have been successfully used to model, predict and simulate 
fading channel dynamics (e.g. in [|17I . Il6l . ifTSl ). the autocorrelation function of which was also 
observed and justified in |[T9l to be close to that of a Rayleigh fading process. 

Note that the dynamic system model of h in (|9]l is stable]^ ||9l- Firstly, if u{t) = 0, the dynamic 
system Q exponentially converges to the unique equilibrium point h = h. Secondly, when u{t) / 0, 
it is also stable provided that ||tt(t)P = Uuit^oo /o^ < oo. This can be verified by 

constructing a Lyapunov function L{h) = (h — h)^(h — h)/2 on the state h and applying the 
LaSalle's invariant principle argument JTOll . |[9l . The stabiUty property in the CSI model h.{t) is 
important because it is a necessary condition for the convergence of the primal-dual algorithm (|2])- 

D. Virtual Dynamic Systems 

In this section, we shall first define a virtual dynamic system. We shall illustrate that studying the 
convergence behavior of the primal-dual iterative algorithm for the saddle point problem in (|2])-([3]) 
is the same as studying the stability behavior of the virtual dynamic system. As a result, the virtual 
dynamic system forms a bridge between the convergence analysis of iterative algorithms and nonlinear 
control theory. 

Definition 1 (Virtual Dynamic System): Let x = (x, A) be the joint state of the primal and dual 
variables in algorithm (|2])-([3]) and let / : R" x i-> M"+'" be the vector valued function 



k(j|£(x, A; h{t))f 
£(x, A; h(t)))' 



dX 



(10) 



I A 

The state x of the virtual dynamic system satisfies the following dynamic equation 



X: X=|5(t) = /(X;h(t)). (11) 



Definition 2 (Equilibrium Point): x* is called an equilibrium point of the vector field x = /(x; h(t)), 
if 7(X*; h(t)) = 0. ■ 

^An autonomous system is a system that can be described by an ordinary differential equation x = f{x) where / does 
not explicitly depend on t 19). 

^The concept of stability is formally introduced in Definition in the following subsection. 
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Note that since the vector field x = /(x; h) is parameterized by the channel state h(t), the 
equilibrium point x*(h(t)) is expressed as a function of h.{t). From ( [TT] ) and Definition |2j we have 
the following result establishing the connection between the virtual dynamic system ( [TT] | and the 
primal-dual algorithm (|2])-([3]) for solving saddle point problems. 

Theorem 1 (Connections between the Virtual Dynamic System and the Saddle Point Problem): The 



equilibrium point x*(h(t)) of the virtual dynamic system ( ]_J_i is identical to the saddle point [x*(h(t)), A*(h(t))] 
of the saddle point problem ([T]). Furthermore, the algorithm trajectories of the primal-dual algorithm 
in Q-Q are the same as those of the virtual dynamic system ( [TT] ). ■ 
Proof: Please refer to Appendix [A] for the proof. ■ 

As a result of Theorem [T| the convergence behavior of the primal-dual algorithm Q-Q can be 
visualized by the stability of the equilibrium point in the virtual dynamic system ( [TT] ). For instance, the 
algorithm in (|2])-([3]) converges to the saddle point if and only if the equilibrium point of the virtual 
system ( [TT] ) is asymptotically stable. When the channel state h is time invariant, the equilibrium 
point x*(h) is stationary and the algorithm trajectory of the primal-dual iterations can be visualized 
to converge to the stationary equilibrium point. On the other hand, when the channel state \i{t) is 
time varying (with comparable time scale as the algorithm trajectory), the equilibrium point x*(h(t)) 
will be time varying and the convergence of the algorithm cannot be guaranteed. This is illustrated 
in Fig. [3] Due to this association, we shall focus on studying the stabiUty of the moving equilibrium 
x*(h(t)) of the virtual system ( [TT] ) in the rest of the paper. 

Due to the time varying CSI, the equilibrium point x*(h(t)) is non-stationary. For mathematical 
convenience, we consider a change of state variable. Define Xe(t) = x(t) — x*(t). Note that Xe(t) 
represents the instantaneous erroi|^ Let V'(h) := (x*(h), A*(h)) = x*(h) be a mapping from the 
CSI h G ^ to the equilibrium point of the virtual dynamic system \\ 1[ ). As Xg = x — x , The error 
process Xe(f) satisfies the dynamic equation of the error dynamic system defined below. 

Definition 3 (Error Dynamic System): The error dynamic system for the virtual system x = /(x; h(t)) 
is given by 

Xe-. Xe = 7e(Se; h(t)) - ip{h.)h{t) (12) 

Where USze, h{t)) := / (X^ + ^(h); h(i)) and ip{h) := ^^(h) . ■ 
Since the error state Xe(t) and the virtual system state x(t) are related simply by a linear transfor- 
mation, studying the stability of X is equivalent to studying the stability of X^.. Finally, we summarize 
the formal definition of as follows (91. 

''Using Theorem [I] Xe(i) also represents the error between the current solution (x(f), A(t)) driven by the iterative 
algorithm in ((2j-((3j and the instantaneous saddle point (x*(t), X*{t)). 
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Definition 4 (Exponential Stability of the Virtual Dynamic System): The equilibrium point x* = 
of ([12]) is exponentially stable if there exist some positive constants c, k, and A such that 

||5e(t)|| < k\\ie{to)\\e-^^'-'"\ V||5e(to)|| < c. (13) 

It is globally exponentially stable if ( [T3] ) is satisfied for any initial state Xe(to)- ■ 
Using Theorem [T] we have global asymptotic stability of the virtual dynamic system corresponding 
to the global asymptotic convergence of the primal-dual algorithm in the original saddle point problem 
and vice versa. 

III. Convergence Analysis of Strongly Concave-Convex Saddle Point Problems 

UNDER Time Varying CSI 

In this section, we shall focus on the convergence analysis of strongly concave-convex saddle point 
problems when the CSI is time varying. In this case, for the saddle point function C, the Hessian 
Vl£ ^ -M^I for > is negative definite and V^/^ h Mxl for Ma > is positive definite. We 
shall first establish the key stability results of the virtual dynamic system under a virtual exogenous 
input. Based on that, we shall discuss the impact of time varying CSI to the convergence of the saddle 
point problem, followed by some examples. 



A. Stability Analysis of Virtual Dynamic System 

We shall study the stability of the virtual dynamic system X^, in ([12]) associated with the primal-dual 
iterative algorithm of a strongly concave-convex saddle point problem in (|2])-([3]). We first consider 
the case when the CSI is time invariant (h = 0). The following lemma summarizes the key results. 

Lemma 1 (Stability of Xe with Time Invariant CSI): When the CSI h(t) is time invariant, the equi- 



librium point X* = of the virtual dynamic system X^ in ([12]) is globally exponentially stable for 

all hen. m 

Proof: Please refer to Appendix |B] for the proof. ■ 
The above convergence result is not surprising since the CSI is time invariant and similar results 
have been established in lUl, Q, ifTTl using different approaches. In the following, we shall extend 
the stability result to the system with time varying CSI. 



As an intermediate, we first consider a virtual dynamic system X^ in ( [12] ) with a quasi-time varying 
CSI h(i). Specifically, h(t) is time varying according to the CSI model in (]9]) with u{t) = 0, for 
all t. As a result, the CSI h(t) varies only during some transient and will converge to h as t — )• oo. 
Define a joint system state as Zg = (xg, he), where hg = h — h, the virtual dynamic system of state 
Zg can be specified as 
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(14) 



/e(Xe; he) + ip{he + h)Ahe 

where the A is the coefficient matrix in the channel model (|9]l. Note that z* = is an equilibrium 
point for Zg = Z{Z(,). The following lemma summarizes the stability results of the virtual system 

Lemma 2 (Stability of under Quasi-Time Varying CSI): Assume the following inequality holds 
for all hen, 

Mh)A\\ < Kmin{2M, -A„,ax(^)} (15) 

where Amax(^) denotes the largest eigenvalue of A, and M = m.m{Mx, Mx}. The joint virtual 
system Zg = Z{ze) has an exponentially stable equilibrium at z* = 0, and there exists a Lyapunov 
function V : ]R"+™+9 ^ R satisfying 



ai||Ze|P < yi'2'e) < «2||Ze||^ 



-— Z(Ze) < -as Ze 



dV 



< aAlze 



(16) 
(17) 

(18) 



for some positive constants oi, 02, as and 04 > 0. ■ 
Proof: Please refer to Appendix [C] for the proof. ■ 

Remark 1 (Interpretation of Lemma^: From Lemma |2| even though CSI h(t) is quasi-time vary- 
ing, we still need condition ( [TS] ) to guarantee globally exponential stability during the transient of h(t). 
In fact, the parameter k affects the convergence rate of the primal-dual algorithm, the eigenvalues of 
A represent the the transient rate of h(i) and the value ||(/?(h)|| can be considered as the sensitivity 
of the moving equilibrium x*(t) to the variation of h{t). The result of Lemma |2] illustrates that when 
the transient rate of h(t) is slow and the sensitivity of the moving equilibrium is small, the underlying 
virtual dynamic system still has global exponential stability. This property will be used to study 
the stability of the virtual system under time varying CSI. 

Finally, we consider the stability result of the virtual dynamic system Ze under the time varying 
CSI model in (|9]l. Specifically, the virtual dynamic system is given by 



Ze{u) : Ze 



/e(Xe; he) + ip(he + h)Ahe 
Ahe 

Z{Ze) + <^>iheMt). 



+ 



ifihe + h) 
I 



uit) 



(19) 
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Note that since u{t) ^ 0, the CSI h{t) is time varying even after the transient and its impact on 



the stabiUty of the virtual system is captured by the virtual exogenous input $(he)u(t) in ([T9]l. The 
following theorem summarizes the stability results for Zf.{u). 



Theorem 2 (Stability of Ze{u) for Time Varying CSI): Given ||<l>(he)|| < 7, ||ti(t)P < the 



average trajectory ||zeP of the virtual dynamic system Ze{u) in ( fTO) ) satisfies 



\\zef = lim ^ r \\Mt)fdt < "^a^ (20) 

where ||n(t)P = Huit^oo ^ Jq \\u{t)\\'^dt. ■ 
Proof: Please refer to Appendix [D] for the proof. ■ 

Note that ||u(t)p denotes the average exogenous excitation of u{t) to the virtual system Ze 
and ||zeP denotes the average tracking error of the system. Based on the result in Theorem [2| 
the following corollary summarizes the convergence performance of the primal-dual algorithm for a 
strongly concave-convex saddle point problem in time varying CSI. 

Corollary 1: (Convergence Performance of Primal-Dual Algorithms for Strongly Concave- Convex 



Saddle Point Problems): Suppose ||(/9(h)|| < 70 for all h G ?^ and ||u(i)P < a , the aver- 



age tracking error ||xeP of the primal-dual algorithm under time varying CSI satisfies ||xeP = 
lim^^oo ^ Jo Pe(t)pdt < ^^^a\ m 
Proof: Since ||$(he)|| = v'llv'lhe + h)P + 1 < ^7^ + 1, by Theorem |2| the average trajectory 
of the corresponding virtual dynamic system Ze{u) for the primal-dual iterative algorithm converges 
to ||zeP = limT_>oo Jq ||ze(t)p(it < "-i^"^"^"'"^) q^. As ||xe|| < ||ze|| the average trajectory ||xe(t)P 
of the error process converges to "■^^'^q^+^^ Q^^ and equivalently, the average tracking error ||xeP of 



-'3 



the primal-dual algorithm satisfies limT^oo 7 Jq \\^e{t)\\'^dt < ^^^^^i^Q^. This completes the proof 
of Corollary [T] ■ 



Note that the average tracking error ||xeP depends on a^, 70 and 04/03 which represents how fast 
the CSI changes, how sensitive the equilibrium x*(t) is w.r.t. the change of CSI h(t), and how fast the 



algorithm converges, respectively. For instance, the average tracking error ||xeP will be smaller if the 



CSI is slowly changing (i.e. ||u(t)P is small) or the convergence rate of the underlying primal-dual 
algorithm (under time invariant CSI) is fast (i.e. smaller 04/03). 



B. Numerical Example 

We shall illustrate the application of Theorem[2]using a numerical example from Section [n-Bl| Con- 
sider a 2-antenna system; the fading channels are given by Hi, H2 G C^^^, and h = vec ([Hi H2]) G 
C^. We choose the channel model in ([9]) as yl = — oJ, h = 1 and u{t) = ^/2aw{t), where w{t) 
is a zero-mean unit-variance white Gaussian process. This corresponds to a standard AR Gaussian 
fading process. The parameter o controls the time-correlation of the CSI and the CSI h(t) has unity 
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variance. The primal-dual algorithm for the specific example is given by 



dvec(Q) 

dt 

dvec(z) 

dt 



gyg^^C(Q, Z; h(t)) 



J 



(21) 



'avecr(z)^ (Q' ^' ^^(0), 

where Q = {Q G C^^"^ : Q ^ 0, tr(Q) < Pt} and J = {Z ^ C^^"^ : Z ^ 0, tr(Z) < Pj} are the 
feasible sets for the transmission covariance matrices, projections {.}g and are to restrict the 
searching directions within the feasible sets Q and J', respectively, and x = [vec^(Q) vec^(Z)]^. 
In this numerical example, the power constraints are set to be Pt = Pj = 10. 

Correspondingly, the error dynamic system Xe and the virtual dynamic system Ze{u) is given 
by ( [T2] ) and ^19\ respectively, with /(x; h(t)) defined in ( |2T] ). The equilibrium point is defined as 
7(5*; h{t)) = and the function <^(h) ^ |^5*(h) = (^1/(5*; h))'' |^7(5*; h). 

It can be numerically verified that || 99(h) || < 11.22 for all possible h G ?^ C C*^, and the condition 
(15 1 in Lemma [2] is satisfied by choosing k = I sec^^. Hence by Theorem [2] we conclude that the 
average tracking error y ||x(t) — x*(t)pdt converges to a|7^a^/a| with 7 = ^/[[(/^(h) P + 1 < 
11.26, = 16a and 04/03 = 0.222 (when the CSI parameter a = 0.02 ms"^). 



IV. Convergence Analysis of Degraded Saddle Point Problems under Time 

Varying CSI 



In Section III we have focused on the case where the objective function £(x, A) is strongly concave 
in X and strongly convex in A. This property provides exponential convergence rate to the saddle 
point (x*, A*) of ([1]). In this section we shall extend the results to deal with the case when one of 
the Hessians is only semidefinite. Without loss of generality, we assume Vie ^ -M^I for > 
and V\C >z 0, and this corresponds to the case where the objective function £(x, A) is strongly 
concave in x and only convex in A. We call such a saddle point problem the degraded saddle point 
problem and the associated virtual dynamic systems the degraded virtual dynamic systems. Examples 
of degraded saddle point problems include the primal-dual iterations (single time scale) of convex 
optimization problems. For instance, the Lagrangian function (|7]) of the NUM problem Q is strongly 
concave in the primal variables r and p but only convex in the LM A. 



A. Stability Analysis of the Degraded Virtual Dynamic System 

Similarly, we first consider the case when the CSI is time invariant (h = 0). For degraded saddle 
point problems, the exponential stability property of the associated virtual dynamic system in 
Lemma [1] does not hold. We have a partial exponential stability property summarized in the following. 
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Lemma 3 (Partial Exponential Stability of Xf. with Time Invariant CSI): When the CSI is time in- 



variant, the equiUbrium point x* = (x*, A*) = of the virtual dynamic system in ( [121 ) is partially 



exponentially stable w.r.t. x* for all h G "H, i.e., there exists a Lyapunov function Vx : 
defined on the virtual state Xp that satisfies 



Cl||Xe||^ < Vxi^e) < C2||Xe 



pn+m s 



dV 

dXe 



/e(Xe; h) < -CsllXelP 
< C4||Xe||. 



(22) 
(23) 

(24) 



The proof is similar to that in Appendix |bJ by simply taking a Lyapunov function 14 (xg) = 
2^x^Xe, and letting Mx = 0. For this Lyapunov function, ci = C2 = C3 = 2Mx and C4 = 1/k. 
Note that, instead of possessing exponential stability on the joint state x* = (x*, A*), the degraded 
virtual dynamic system just has exponential stability on x* as shown in ( |23l ). 

Similarly, we study the stability of the virtual dynamic system Xe = fei^e] — ip{h)h.{t) with 
a quasi-time varying CSI h.{t) for u{t) = 0. Since we have exponential stability only on x* but not 
on A*, we define a partial state as Ze = (xg, hg). As a result, the trajectory of the partial state Zg 
satisfies the following virtual dynamic system Ze : 

/e(Xe, Ae = 0; hg + h) + ifixiK + h.)Ahe 
Ahe 

^-£(xe + x*, Ae + A*; h{t))Y and V9,(h) = |iX*(h) = |iVx(h). 



2,, : 



Zize) :-- 



(25) 



where /e(xe, Ae; h(i)) =- y^-g^^ 
The following lemma summarizes the stability results for the virtual system Zg = Z{Z(.). 

Lemma 4 (Partial Exponential Stability of Zf. under Quasi-Time Varying CSI): Suppose the follow- 
ing inequality holds for all h G "^^i 



\^x^)A\\ < Kmm{2Mx, -Amax(^)} 



(26) 



where Amax(^) denotes the largest eigenvalue of A. Then the system in ( [25| ) is partially expo- 
nentially stable, i.e., there exists a Lyapunov function for the joint state Ze satisfying 



Ol||Ze|P < V{Ze) < a2\\Ze\\' 

dV 

V{Ze) = -^Z{Ze) < -aslIZe 
UZf, 



dV 



dZe 



< a4\\Zf, 



(27) 
(28) 

(29) 



Proof: Please refer to Appendix [E] for the proof. 
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Lemma |4] suggests that as long as the transient of quasi-time varying h(t) is not changing too fast 
(i.e. A has small eigenvalues) and the sensitivity of the primal part of the equilibrium x* w.r.t. the 
change of h(t) is small (i.e. small ||(/9a;(h)||), the virtual dynamic system Zg still possesses globally 
exponential stability on the partial state Zg = (xg, hg). 

Finally, we consider the stability result of the degraded virtual dynamic system under the time 
varying CSI model in ([9]). The degraded virtual dynamic system Ze{u) is given by 



ZJu) 



/e(Xe, Ae = 0; h) + (/7^(he + h)Ahe 
Ahe 

= Z{Ze) + ^xihe)u{t). 

The stabiUty result is summarized in the following theorem. 



ipx{he + h) 

I 



u{t) 



(30) 



Theorem 3 (Stability of Zf.{u) for Time Varying CSI): Given ||<l>^.(he)|| < 7^, and \\u 



a 



the average trajectory ||zeP of the degraded virtual dynamic system Z^^u) satisfies 

rT 



1 

T 



2rft < 







where ||n(t)P = limj'-i.oo 75 /q ■ 
The proof can be derived similarly from that of Theorem |2] using the property of Lemma [4] As 

a result, we summarize the convergence performance of the primal-dual iterative algorithm for a 

degraded saddle point problem in the following corollary. 

Corollary 2: ( Convergence Performance of Primal-Dual Algorithms for Degraded Saddle Point 



Problems): Suppose (/?x(h) < for all h G F and ||n(t)p < a^, the average tracking error ||xe 



for the primal-dual algorithm under time varying CSI satisfies ||xeP 
"1(51+1)^,2 



lim- 



00 



1 rT 



f 

T JO 



Xf 



-a . 



The proof can be derived similarly from that of Corollary [TJ by applying Theorem [3] and using 
ll^x(he)|| = V'll¥'x(he + h)||2 + l < VtJTT and ||xe|| < ||Ze||. 

Together with Lemma [4j the above theorem and corollary establish sufficient conditions on the 
convergence performance of the a primal-dual algorithm for a degraded saddle point problem under 
time varying CSL It shows that once the corresponding virtual dynamic system Zg = Zu{Z(., u{t)) is 



exponentially stable at z* = for u{t) = 0, the system is stable for general u{t) as long as 
is bounded. Correspondingly, the tracking error of the primal-dual algorithm under time varying CSI 
is bounded and scaled according to O 



u 



Note that, due to the degraded saddle point problem in the dual variable A, the average error bound 



a in 



Corollary jij does not include the dual variable Ae, and hence we cannot bound the 
tracking error of the dual variables. This is because due to the semidefinite Hessian matrix VxC >z 0, 
the dual variable A in the primal-dual iterative algorithm Q-Q may not converge exponentially fast 
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to the saddle point A*. However, this result is still meaningful because, in many applications such as 
constrained optimizations, we are mainly concerned about the behavior of the primal variable x. We 
shall illustrate with an example in the next section. 

B. Numerical Example 

We illustrate an application of Theorem [3] using a numerical example in Section II-B2 Consider 



a simple wireless network with 3 nodes depicted in Fig. |2j where the collection of traffic flows is 
given by C = {(1,2), (1,3), (2,3)}, and the sets of links are L(l,2) = {1}, L(l,3) = {1,2} and 
L(2, 3) = {2}. Pti and Pt2 denote the total power allocated to link 1 and link 2, respectively, and hi 
and /i2 are the corresponding channel gains. Let h = [hi /i2]^- Our CSI dynamic is modeled in (|9]l, 
with A = —al and u{t) = \/2aw{t) where w{t) is a zero-mean unit- variance white Gaussian process. 
The variance of the CSI h(t) is normalized to unity and the parameter a controls the time-correlation 
of the CSI. The average receiving SNR is normalized to 10 dB. We consider Usd{fsd) = log(l + rsd) 
as the cost function, and the link capacity function is given by ci{Pu, hi) = log{l + hf Pu) , V/ = 1, 2. 
The Lagrangian saddle point problem for this specific example is given by 

minmax£(r, A) = J] log(l+r,d)- J] A J J] r,^ - log(l + hfPu) + J] xi'^'^.a. 

^ (s,d)eC 1=1,2 \{s,d):l£L{s,d) J (s,d)£C 



The error dynamic system and the degraded virtual dynamic system are given by ( [T2| ) and p5) , 
respectively. Specifically, let r = [ri2 ri3 r23]^ be the primal variable and A be the collection 
of the dual variables. Then Xe = x — x*, x = (r, A), Zg = (r, h), <I>r(h) = [93r(h); /], and 
(/7(h) = T^x*(h) = (§£) ^ which is derived by the implicit function theorem, where the imphcit 
function F{r*, A*) = is given by the KKT conditions 

Vr/:(r*, A*) 

AF iEKd-ciiPtuhi)) 



F{r*, A*) 



Numerical results suggest that ||(/7r(h)|| < 2.2, and hence the constants in Lemma |3] can be 
chosen as k = 0.5 sec^^, such that condition ([26]) in Lemma [4] is satisfied. Thus we can apply 
Corollary [2] which shows that the average tracking error for the rate allocation variables ||reP = 
liniT^oo T lo l|re(i)f < f TrO^ where 7^ < ^||(^^(h) P + 1 < 2.42, = 4a, and 04/03 = 
1.69 (when the CSI parameter a = 0.04 ms^^). 

V. Adaptive Compensation for the Primal Dual Algorithms in Time Varying 

Channels 

In the previous sections, we have analyzed the convergence behavior of the primal-dual algorithms 
under time varying CSI. In this section, we consider modifying the primal-dual iterations to reduce 
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the tracking error in time varying CSI. To this end, we shall introduce a compensation term in the 



iteration to offset the exogenous excitation to the virtual dynamic system in ( [T9| ) and ( [30| ). In the 
following, we shall first discuss the construction of the compensation term. Based on that, we shall 
obtain a low complexity distributive implementation for the compensation method. 



A. Adaptive Compensation for the Primal Dual Algorithm 

We have shown in Theorem [2] and Theorem |3] that the average tracking error of a primal-dual 
algorithm under time varying CSI is 0(0^7^) where 0^7^ is the square average of the norm of 
the exogenous input <I>(he)u(t) to the virtual dynamic system Zf.. Consider a particle trapped in 
an energy well given by the Lyapunov functiorj^ V{ze). The presence of the exogenous input term 
$(he)u(t) to the system Z^. excites the particle and moves it away from the equilibrium position, 
and how far the particle can be moved depends on the exogenous excitation energy induced by u{t). 
This phenomenon is illustrated in Fig. |4] As a result, one way to reduce the tracking error is to 
compensate the disturbance from the exogenous input <I>(he)n(t). We thus introduce a compensation 
term <I>(ze)u(t) to the virtual dynamic system Ze{u) in (fTO]) as follows. 



where $(ze) 



Zeiu) : 



Ze = Z{Ze) + '^(he)n(t) - $(Ze)u(t) 



(31) 



is the compensation term to be derived. Obviously, if we could set 



$(ze) = <&(he), the impact of the exogenous input $(he)ii(t) can be totally suppressed and the virtual 
dynamic system Ze (n) will be free from exogenous excitation. However, since we do not have closed 
form expression for the saddle point x*(h), <I>(he) cannot be obtained during the iteration. 

On the other hand, suppose that there exists a function F : x ]R"+™ e such that 

F(x*; h) = and ^-F(x*; h) is non-singular for all x*(h) and h € H C R'^. Using the impUcit 
function theorem, the function ^{hf.) in ( |3T] ) is given by 

- [JlF(X*(he + h); he + h)]"'|^F(X*(he + h); + h) 



(32) 



Note that the function F{.) can be found by the optimality conditions of the saddle point problem. 
For example, it can be chosen as /(x*; h{t)) = by the definition of equilibrium point (Definition 



2|, where /(.) is given in (fTO]). Using ([32]), we can estimate ^{he) by 



- Xe; h(t))) ' - Xe; h(t)) 



^(x(t); h{t)) 
I 



(33) 



The Lyapunov function can be interpreted as tlie energy function of the system state Ze 
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Note that the joint state Zg = (xg, hg) and x(t) are the algorithm outputs at the t-th algorithm 
iteration in (|34|)-([35]). The estimate ^{z^) in ( [33] ) is quite accurate when the algorithm trajectory x(t) 
is close enough to the equilibrium x*(h(t)) (i.e. ||xe|| is small enough). As a result, the corresponding 
primal-dual algorithm iterations with compensation is given by 

" d 



A 



dx 
"(it 

dX 

~(E 



;£(x, A; h(t)) + (^x(x, A; h(t))^h(t) 



d: 



'X 



d_ 



£(x, A; h(t)) + (^a(x, A; h(t))^h(t) 



(34) 



(35) 



where (^x(x, A; h(t)) and (^a(x, A; h(t)) are the primal and dual parts of the compensation term 
(^(5; h{t)). 



The compensation terms in p4])-(|35|) can also be interpreted as a predictor on where the saddle 
point X* (h) moves as illustrated in Fig. |5] We summarize the performance of the proposed algorithm 
in the following theorem. 

Theorem 4: {Tracking Performance of the Compensated Primal-Dual Iteration for Strongly Concave- 
Convex Saddle Point Problems): Suppose that the compensation term ^(x; h(t)) is Lipschitz con- 
tinuous on X satisfying ||^(x(t); h(t)) - ^(x*(t); h(t))|| < L||xe(t)|| for all h(t) G H C and 

1 rT 



\{t) G M™, and ||n(t)|| = limT-s.oo ^ Jq 11^^(011^^ = P < ^fi- Then the average tracking error ||x, 
of the proposed algorithm (|34])-([35]l converges to 0. 
Proof: Please refer to Appendix [F] for the proof. 



Similarly, for a degraded saddle point problem, the same proposed algorithm p4|)-p5) can be 
applied, and the degraded virtual dynamic system Ze{u) is given by 



where 



(36) 



^x(x, A; h(t)) 
I 

and the associated performance is summarized below. 

Corollary 3: (Tracking Performance of the Compensated Primal-Dual Iteration for Degraded Sad- 
dle Point Problems): Suppose the compensation term ^x(x, A; h(t)) is Lipschitz continuous on x 
satisfying ||^x(x(t), A(t); h{t) - (?x(x*(t), A(t); h(t))|| < L^\\yie(.t)\\ for all h{t) G H C and 

T /o'" 1 1^(^)1 1 '^^ = /3 < (iff^- Then the average tracking error 



A(t) G M?, and ||u 



lini' 



oo 



||xeP of the proposed algorithm (|34l)-([35|) converges to 0. ■ 
Corollary [3] can be obtained from Theorem |4] in a straight-forward manner by replacing the variable 
Zp with Zp. 
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B. Distributive Implementation 

The proposed algorithm (|34l)-(|35]) discussed in the previous section requires centralized computation 
for the compensation term ^(x; h(t)). However, for a wireless network, distributive solutions are 
usually desired. One way for distributive implementation is via message passing (i.e. communicating 
with other nodes to obtain non-local variables), while each node computes a version of (^(x; h(i)) 
based on the global knowledge. However, this involves a significant communication overhead and 
consumes a lot of computing resources. In the following, we propose a low complexity distributive 
implementation for computing the compensation term. 

Consider an ad hoc network topology as depicted in Problem |2] in Section II-B2 To facilitate 



distributive implementation of the compensated primal-dual algorithm p4])-([35|), we partition the 
whole network into G groups of nodes and partition the collection of optimization variables x(t) = 
[Xi(i)|52(t)| . . . \ScGit)] accordingly. Let B{t) = h{t)) and K{t) = h(t)). We 

then impose a block diagonal structure to the matrix B{t) and partition K{t) as 

Bit)=h\kdmg{Bsi,{t),Bs,,it),...,Bsi,{t)}, ^(tf = [K^ (t) | | ...\KT^{t)] (37) 

where Xj(t),Vi = is the collection of variables for the i-th group of nodes, i?x, (t) = 

^Fx. (x(t); h{t)) is a block of B{t) for the corresponding variables Xj(t), and K^. (t) = Jj^-^x, 
is a block of K(t) for the corresponding variables Xi(t). Therefore, the compensation term for the 
variable Xj(t) is just ^x^(t) = —B~^{t)Ksi.{t). For example, for a NUM problem with a topology 
depicted in Fig. |2] the network can be partitioned into two parts according to the two links, where 
xi = {ri2, ri3, Ai} and X2 = {r23, X2} (i-c Xi are the corresponding dual variables). The distributive 
update equation for the collection of variable Xj is 



X,; 



(VxX(x; h(t)) + (?x.(5; m)f 



I yi = l,...,G (38) 



where Vx.£(.) denotes the searching direction by the gradient of and [.]- denotes the corre- 
sponding projection for the variable Xj. 

As the terms i?x^ (t) and K^^ (t) just involve local variables, i^x^ (t) can be computed locally using 



only local information and hence the update equation p8| ) can be implemented distributively. Note 
that in wireless communication networks, nodes that are far away from each others usually have 
weak connections, and hence B{t) is actually a sparse matrix with dense diagonal entries. Therefore, 
using the block diagonal version B{t) in ( [37] ) will not contributes too much performance loss in the 
proposed algorithm. 

VI. Results and Discussions 

In this section, we shall simulate the tracking performance of the primal-dual algorithms for various 
saddle point problems we studied before. We evaluate the performance of the proposed algorithms 
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with the adaptive compensation term compared with the baseline algorithms in IS, EOl . lIlTl . The 
system performance loss due to the time varying channels and the tracking errors is also illustrated. 

A. Convergence Performance of the Strongly Concave -convex Saddle Point Problem and the Degraded 
Saddle Point Problem 

The simulations of the convergence performance for the strongly concave-convex saddle point 



problem and the degraded saddle point problem are based on the examples described in Section III-B 
and Section |IV-B[ respectively. The average tracking error in the jamming problem is defined as 
? = E [||X(t) where = vec([Q(t) Z{t)]), while = E [||r(t) - r*{t)f] defines 

the average tracking error in the NUM problem, where r(t) is the rate allocation variables. In both 
cases, the CSI model is specified by an AR process h = — o(h(t) — h) + ^/2aw{t), where h = 1, 
w{t) is a zero-mean unity-variance Gaussian process and a is a parameter to synthesize the fading 
rate. Fig. [v] and Figjs] illustrate the average tracking errors versus the fading rate parameter a in 
both of the problems. As a increases, the CSI changes more rapidly and the tracking errors of the 
primal dual algorithm increase. Moreover, for the same parameter a, the larger the average receiving 
SNR is, the larger tracking error will be. 

B. Convergence Performance of the Proposed Algorithm with Compensation 

To evaluate the algorithm performance in a more concrete communication scenario, we consider a 
NUM problem of a wireless ad hoc network with 6 nodes 8 links and 8 data flows as illustrated in 
Figj6] The data flows are delivered simultaneously with fixed routes. Links from the same transmitting 
node occupy different subbands while the interference at each receiving node is handled by multiuser 
detection (MUD) techniques. The problem is to optimize the transmit power under the time varying 
channels to support required traffic data rates. Similar ad hoc network scenario is also considered in 
|[22l . Similarly, the CSI model is specified by the AR process h = — a(h(t) — h) + ^/2aw{t), where 
a is a parameter to determine the CSI fading rate. 

The proposed algorithms with adaptive compensations are compared with the following baseline 
algorithms. 

• Baseline 1 - Conventional Primal-Dual Gradient Algorithm (ConPDGA) HI, 0: The primal 
and dual variables are updated simultaneously in Q-Q in the gradients of the objective function. 

• Baseline 2 - Averaging Primal-Dual Gradient Algorithm (AvgPDGA) |[20l|: The primal and 
dual variables are updated in the gradients of the objective function, and the approximate primal 
solutions are generated by averaging over the past primal solutions. 

. Baseline 3 - Perturbed Primal-Dual Gradient Algorithm (PerPDGA) im, 123: The primal 
and dual variables are updated in the gradients evaluated at perturbed points that are generated 
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from current points via auxiliary mappings. 

Proposed Primal-Dual Algorithm - Centralized: The primal and dual variables are updated 



in a compensated direction in (|34|)-(|35) to offset the potential movement of the saddle point. 
Proposed Primal-Dual Algorithm - Distributed: The primal and dual variables are updated in 



a compensated direction in ([38]) with imposed block diagonal structure in ( [37| ). 
Fig. [9] shows the comparison of the average tracking error versus a in the time varying CSI 
model. The tracking errors have been reduced greatly after introducing the compensation terms in 
both of the proposed algorithms. 



C. Performance Loss due to Tracking Errors 

In this section, we consider the degradation of the system performance (network throughput) and 
the CSI fading rate. Due to the time varying channels, the problem constraints (e.g. the link capacity 
constraints) may not be satisfied at every time slot. As a result, packet drops may occur and we define 
the system performance as the average actual throughput given by: 



= E 



kaK \{s,d)&T{k) 



where /C denotes the set of receiving nodes, T{k) denotes the set of traffic flows arrived at node 
k, Cfc denotes the vector of link transmission rates of the links towards node k, Cfc(.) denotes the 
capacity region at receiving node k and 1(.) is the indicator function to account for the penalty of 
constraint violation due to time varying CSI. For example, at node /c = 5 in Fig. |6j the set of traffic 
flows arrived is T(5) = {?'22,'^42,'^ii} and 65 = (rsjfy) is a rate vector corresponding to transmission 
rates fs and rj allocated to link 5 and link 7, respectively. 



Fig 10 shows the the average network throughput versus the channel fading rate a. The average 
throughput decreases when the fading rate a increases, as indicated by a growing average tracking 
error in Fig|9] The results also show that the proposed algorithms significantly outperform over all 
the other baselines. 



VII. Conclusions 

In this paper, we have analyzed the convergence behavior of the primal-dual algorithm for solving a 
saddle point problem under time varying CSI, which was modeled as an AR process. The convergence 
results have been derived by studying the stabilities of the equivalent virtual dynamic systems based 
on the Lyapunov theory from the control theoretical approach. We showed that for both the strongly 
concave-convex saddle point problem and the degraded saddle point problem, the average tracking 
errors were given by 0{a'^), where represents the average power of the exogenous excitation 
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induced to the CSI dynamics. Based on these analyses, we have proposed a novel adaptive primal- 
dual algorithm with a predictive compensation to counteract the effects of the time varying CSI. 
We showed that the average tracking error of the proposed algorithm converges to zero despite 
time varying CSI. Numerical results were consistent with our analysis and the proposed algorithm 
demonstrated significantly better convergence performance over the baseline schemes. 

Appendix A 
Proof of THEOREM[r| 



Proof: The equilibrium point x*(h(t)) of the virtual dynamic system A' in ( satisfies /(x*(h(t)); h(t)) 
0. By Definition [T] it is equivalent to 

|^£(x*, A*; h(t)) = 0, A,^£(x*, A*; h(t)) = 0, Vz (39) 

which is the optimality condition for the saddle point problem. As the saddle point problem ([T]) is 
strongly concave in x, the saddle point [x*(h(t)), A*(h(t))] is uniquely determined by ([39]l, which 
proofs that the saddle point and equilibrium are identical. 

From Definition [T] the virtual dynamic system ( [TT| ) and the primal-dual algorithm Q-Q share the 



same formulation. Hence the trajectories of the two are the same. ■ 

Appendix B 
Proof of Lemma[T] 

Proof: Take F(xe) = 2^xjxe as a Lyapunov function defined along the trajectory of the virtual 
dynamic system ( [T2] ). Suppose that Xg + x* is in the interior of the domain x R™. From 



V/e(Xe; h) = K 



Vie V^xC 

-v^/: -vie 



and the strong convexity / concavity properties V^C ^ —M^I and V|£ >z M\I, we obtain 
Then V{0) = and for any Xe / 0, 

1 • 1 ~ 

y(Xe) = Vy(Xe)Xe = -X^Xg = -X^/e(Xe; h) 

K K 



= -Xf /e(0+; h) + -i^ /' V/e(eXe; h)^^^^ 
K K Jq 

1^ Jo ^ 



< -2M||xJP 



where xj/e(0+; h) =< x — x*, V£(x*; h) >< from the optimality condition of the original saddle 
point problem. Here for simpUcity, we denote V£(x; h) = [^C{x, A; h) — g^£(x, A; h)]^. 
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Therefore, using the Lyapunov theory ||9l, we conclude that the virtual dynamic system is expo- 
nentially stable for every time-invariant h. ■ 



Appendix C 
Proof of Lemma[2] 

Proof: One possible choice of Lyapunov function on virtual state Zg along the algorithm trajectory 

(ze) = ^xjxe + ^hjhe- Then we have 

11 1 1 11^ 



min{^,^}||zef <y(ze 



Xell +-||he|| <max|; 



2k 



and 



dV 
die 

1 



Zfzp 



7e(xe; he + h) + (^(hg + h)Ahe 



-Xe/e(Xe; h) + hMh + -Xe(/7(h)Ah 

< -2M||Xef + A,„ax(^)||hf + -||5e||||<^(h)^||||h|| 



< 



min{2M, -XmU^)} - -M^)A\\ 



where as = min{2M, — Amax(^)} — > by the condition in (15 1. In addition, we have 

dV 



dze 



< A/il|xeP + ||hP < max{-, l}||ze 



Therefore, using Lyapunov theory [9|, we conclude that the joint system is exponentially stable and 
([T6]l-([T8]) are satisfied with ai = min{l/2K, 1/2}, a2 = max{l/2K, 1/2}, 03 = min{2M, -Amax(^)}- 
^||((i?(h)A|| and 04 = max{l/K, 1}. ■ 



Appendix D 
Proof of Theorem[2] 

Proof: From Lemma |2| there exists a Lyapunov function V(ze) defined on the joint state z^ 



along the trajectory Zg = Z{ze), satisfying ([T6l)-([T8|). Then, for the system (19\ , 

dV ~ ^ dV _ 9 

y(Ze) = — Z(Ze) + 7^^$(he)n(t) < -a3||Ze|r + a4||$(he)||||Ze||||'U 

Suppose there exists e > 0, such that 

lim i f\\ie{r)rdr>%W + e 
T->oo i Jo O3 

Then given any t > 0, there exists To(t) > 0, such that for all T > Tq, 

rt+T „2 



t + T Jq ai 



(40) 
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and there exists Ti{t) > 0, such that for all T > Ti, 

1 /■* f 

— / \\ZeiT)fdT < -. 



Choose < 7] < f-;^, and then from = Uuit^oo /q ll^^l*)!!^*^* < there exists 

T2{t) > 0, such that for all T > T2, 



TJt 



t+T 



\uij)\?dT 



t + T 1 
T t + T 



t+T 



\u{T)fdT < 



t + T 



[a 



(41) 



Finally, let Tsit) = max{To{t),Ti{t),T2{t)}. Then for all T > T3, 



1 



t+T 



t + T 
T 



1 



t+T 



t + T\J, ■■^^(^)ll'^^- I ll^e(r)fdr 



> 



t + T f al 2 2 ^ 



T Vaf " + 
(42) 



Hence the T-step Lyapunov drift is given by 

V{Ze{t + T))-V(Zeit)) 

ct+T r-t+T 

< - I a3\\Mr)fdT+ / a47l|ze(r)||||n(r)||c?r 



< -03 



|u(T)||2dr 



2e(T)PdtT 



< 



(""V (i^'"' + 1) + + a47 + V)it + T)^ ^ J^'^'' WMrWdtT 



< 

by ( [4T] ) and (|42]). By the property of the Lyapunov function V{ze) in ([16]) , for any given t > 0, we 
have ||ze(t + T)|| < ||ze(t)|| for all T > T3. Hence, we have lim^-^oo ||ze(i)|| = 0, which contradicts 



to the assumption in ( |40] ). So the average trajectory ||zeP < ^7^a^, which proofs the stability. 



Appendix E 
Proof of LemmaH] 

Proof: A simple choice of Lyapunov function is V{ze) = Vb(xe) + ^hjhe, where Vb(xe 



K(Xe)L^=o = and = i[x^ 0^]. Hence we have 

min{^, ^}||ze|p < ci||xe|p + ^l|h|P < V{ze) < C2||xe|p + ^l|h|P < max{^, ^] 
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and 



dt 



dV{ze) die 
dze dt 



aXe £ 



/e(Xe; he + h) + (/?(he + h)Ahe 



_ /e(Xe; he + h) + ^^^^h^ + h^Ah^ 

axe axe 

< -CsllXell^ + C4||Xe||||(/?x(h)^||||he|| + Amax(^) ||he|P 

< - [min{c3, -Amax(^)} - C4||((Cj.(h)A||] ||ze|p 
^ II Il2 

where C3 = 2Mx and C4 = 1/k following the discussion in Lemma [5] Moreover, 

5y(ze 



5Zp 



9Xc 



< A/clllxeP + ||h||2 < Wmax{^, l}||z 



Therefore, using the Lyapunov theory ||9l, we conclude that the system Zg = 2^(ze) is exponentially 
stable for the joint state Zg = (xg, h) and (|27])-(|29l) are satisfied with ai = min{l/(2K), 1/2}, 
02 = max{l/(2K), 1/2}, 03 = min{2M^, -Amax(^)} - ^||(/?a;(h)^|| and 04 = ^max{l/K2, 1}. ■ 



Appendix F 
Proof of Theorem|4] 

Proof: Consider the virtual dynamic system with compensation in ( |3T] ). Since 

^(X*; h(t)) -^(X; h(t)) 



$(he) - ^{ie) 







then ||$(he) - <5(ze)|| = ||^(x*; h(t)) - ^(x; h(t))|| < L||xe|| < i||ze||. Applying the Lyapunov 
function F(ze) in Lemma [2] to the virtual dynamic system Zf.{u), we obtain 

dV ~ ^ dV _o -~ 

y(Ze) = 7^^(Ze) + 7v^$(he)n(t) < -03 ||Ze f + 04 1| $(he) - $(Ze) || || n(t) || ||Ze || 

< -a3||ze|p + a4L||u(t)||||ze||^ 



Suppose there exists e > 0, such that 



lim — 



i+T 



Izefrllrdr > e 



(43) 



Then there exists To{t) > 0, such that for all T > Tq, ^ ||ze(T)p(ir > §. 

As liuiT-^oo Jf'^'^ \\u{T)\\dT = f3 < we choose r] = (^-^ — /2- Then there exists 

Ti{t) > 0, such that for all T > Ti, ^ //+'^ \\uiT)\\dT < /3 + ri< 

Observed that, Ze{t) only depends on all the realizations before u{t-). Therefore and 



are independent and hence. 



lim — 

T->oo T 



t+T 



\u{T)\\\\ie{T)fdr 



lim — 



t+T 



\u(t) Mt 



T 



t+T 



\Mr)fdT. 
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Therefore, if we choose any < 5 < 7/e/2, then there exists T2{t) > 0, such that for all T > T2, 

1 ft+T 1 j-t+T 1 pt+T 

- \\u{r)\\\\i,{T)fdT<- \\u{T)\\dT.- \\i,{T)fdT + 6. 

As a result, let Ti{t) = max{ro(t), ri(t), r2(t)}, and then for all T > T3, the T-step Lyapunov drift 
becomes 

rt+T pt+T 

V{z,{t + T))-V{z,{t)) < - as\\z,{T)fdT + a4L ||n(r)||||ie(T)f dr 

Jt Jt 

rt+T / 1 f-t+T pt+T 

< - I a4ze{T)fdT + a4Li- I ||n(r)||(ir / \\ze{T)fdT + 6T 

< \-a-i + a4,L. — J ||ii(r)||(ir J J \\ze{T)\\'^dT + a4,L6T 



t+T 

< —ttiLrj I \\ze{T)\\'^dT + a4L6T 

< (-r/| + 5)a4Lr 

< 



By the property of the Lyapunov function V{ze) in (I61, given any t > 0, we have ||ze(i + T)|| < 



|ze(t)|| for all T > T3. Hence, limt__>oo ||ze(i)|| = 0, which contradicts to the assumption in ( |43] ). So 



the average trajectory ||zeP converges to 0, and hence the tracking error ||xeP converges to 0. ■ 
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UserX 




Jammer Z 



Receiver Y 



Figure 1 . A system model for a point-to-point MIMO with a jammer. Tiie user X transmits signal to Y througii tlie ciiannel 
Hi and a jammer Z transmits a jamming signal to Y through the channel H2. 




''12' ''13 



Link 1, P 



ti 




Link 2, 




Figure 2. A specific example of the multihop wireless network with 3 nodes and 2 links, where the collection of traffic 
flows is given by C = {(1, 2), (1, 3), (2, 3)}, and the sets of links are L(l, 2) = {1}, L{1, 3) = {1, 2} and L(2, 3) = {2}. 
Pti and Pt2 denote the total power allocated to link 1 and link 2 respectively. 



Quasi-static CSI h(t): 
Static Saddle Point 



Algorithm Trajectory S(t) 




Time-Varying CSI h(t): 
Moving Saddle Point SX^Ct)) 



Algorithm Trajectory x{t} 



> X 




Figure 3. An illustration of the convergence behavior of the primal-dual algorithms. In (a), the equilibrium point in the 
equivalent virtual dynamic system is asymptotically stable when the CSI h{t) is quasi-static |1|. However, in (b), when the 
CSI h{t) is time-varying, the equilibrium point x*(t) becomes also time-varying and the convergence is not guaranteed. 
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Viz,) 




Figure 4. An illustration of the excitation due to the exogenous input $(he)it(t) for the equivalent virtual dynamic 
system Ze. We can visualize the Lyapunov function V(ze) as an energy function. Without the time- varying channel effect 
{u{t) = 0), the algorithm trajectory will eventually converges to the equilibrium z* = (corresponding to exponential 
convergence of the primal-dual iterations in l[2](-l[3]() because there is no exogenous excitation energy applied to the system. 
With the time-varying channels, there is an uncertainty region (shaded region) where the system state will converge to 
(corresponds to the convergence region in Corollary [I] and in Corollary [2}. The region is bounded and the size 
depends on the "energy" applied to the system due to the exogenous input <t{\\e)u{t). 




, ^Proposed Searching Direction 
{with Compensation) 



Figure 5. An illustration of the proposed algorithm with adaptive compensation to improve the convergence behavior of 
the primal-dual iterations in time-varying channels. Instead of searching via the steepest descent direction V/I(x, h) in 
the conventional primal-dual algorithm, it searches in a compensated direction V£(x, h) + ^(x, h) to offset the potential 
movement of the saddle point x* [t) . 




Figure 6. A specific example of wireless ad hoc network with 6 nodes 8 directed links and 8 data flows. The data flows 
are delivered simutaneously with fixed routes. Links from the same transmitting node occupy different subbands and do 
not interfere with each other. The interference at each receiving node is handled by multiuser detection (MUD) techniques. 
Allowable transmission rate for each link is determined by the capacity region at the corresponding receiving node. 
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Figure 7. The convergence performance of the primal-dual algorithm for the strongly concave-convex saddle point problem 
given in Section III-B The CSI model is given by h = A{h — h) + y/2aw{t), where A — —al, h = 1, and 'w{t) is a 
zero-mean unit-variance white Gaussian process. The variance of the CSI h{t) is normalized to unity, and the parameter a 
controls the time-correlation of the CSI. It shows that as the SNR increases, the tracking error also increases for the same 
CSI variation rate in terms of a. 




Figure 8. The convergence performance of the primal-dual algorithm for the degraded saddle point problem given in Section 

a(h — h) + \/2aui(f), where h = 1 and wit) is a zero-mean unit-variance white 



IV-B 



The CSI model is given by h ; 

Gaussian process. The variance of the CSI h(t) is normalized to unity, and the parameter a controls the time-correlation of 
the CSI. It shows that as the SNR increases, the tracking error also increases for the same CSI variation rate in terms of a. 
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Figure 9. The convergence performance comparison of the proposed algorithms and the baseline algorithms |3l. 1211 . 1201 . 
which solve a NUM under the wireless network in Fig[6] The CSI model is given by h = — a(h — h) + \f2aw{t), where 
h = 1 and w{t) is a zero-mean unit-variance white Gaussian process. The variance of the CSI h(t) is normalized to unity, 
and the parameter a controls the time-correlation of the CSI. The tracking errors of the proposed algorithms are much 
smaller than those of the baselines. 
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Figure 10. Average network throughput versus the channel fading rate parameter a, while applying the various proposed 
and baseline algorithms to solve the NUM problem under the network in Fig|6] The average throughput decreases when 
the fading rate a increases. The proposed algorithms have much higher network throughput than all the other baselines. 
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